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Abstract 

We present a detailed proof of the Dovbysh-Sudakov representation for symmetric 
positive definite weakly exchangeable infinite random arrays, called Gram-de Finetti 
matrices, which is based on the representation result of Aldous and Hoover for arbitrary 
(not necessarily positive definite) symmetric weakly exchangeable arrays. 
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1 Introduction. 

We consider an infinite random matrix R = {Ri^)i^>\ which is symmetric, nonnegative 
definite in a sense that (Rij')i<i,i>< n is nonnegative definite for any n > 1, and weakly 
exchangeable, which means that for any n > 1 and for any permutation p of {l,...,n} 
the matrix (R P (i), P (i>))i<i,i'<n has the same distribution as {Ri,v)i<i y v< n - Following [6], we will 
call a matrix with the above properties a Gram-de Finetti matrix. Since all its properties - 
symmetric, positive definite and weakly exchangeable - are expressed in terms of its finite 
dimensional distributions, we can think of R as a random element in the product space 
M = rii<n' with the pointwise convergence topology and the Borel a-algebra M.. Let V 
denote the set of all probability measures on M.. Suppose that P G V is such that for all 
AeM, 

F(A) = I Q(u, A) dPr(u) (1.1) 
Jn 

where Q : Q x M. — > [0, 1] is a probability kernel from some probability space (Q, J 7 , Pr) to 
M such that (a) Q(u, •) G V for all u G Vt and (b) Q(-, A) is measurable on T for all A G M. 
In this case we will say that P is a mixture of laws Q(u, •). We will say that a law Q G V of a 
Gram-de Finetti matrix is generated by an i.i.d. sample if there exists a probability measure 
rj on £ 2 x ]R + such that Q is the law of 

(hi-hp + cuSw) (1.2) 
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where (hi,ai) is an i.i.d. sequence from 77 and h ■ h! denotes the scalar product on £ 2 . For 
simplicity, we will often say that a matrix (rather than its law on Ai) is generated by an 
i.i.d. sample from measure rj. The result of L.N. Dovbysh and V.N. Sudakov in [6] states the 
following. 

Proposition 1 A law P e V of any Gram-de Finetti matrix is a mixture U.l\) of laws in V 
such that for all uGfl, Q(u, •) is generated by an i.i.d. sample. 

Proposition [1] has recently found important applications in spin glasses; for example, it 
played a significant role in the proof of the main results in [3] and [IT] , where a problem 
of ultrametricity of an infinite matrix (Ri,i')i,i'>i was considered under various hypotheses 
on its distribution. For this reason, it seems worthwhile to have an accessible proof of this 
result which was, in fact, the main motivation for writing this paper. Currently, there are two 
known proofs of Proposition [TJ The proof in the original paper |6J contains all the main ideas 
that will appear, maybe in a somewhat different form, in the present paper but the proof 
is too condensed and does not provide enough details necessary to penetrate these ideas. 
Another available proof in [8] is much more detailed but, unfortunately, it is applicable not 
to all Gram-de Finetti matrices even though it works in certain cases. 

In the present paper we will give a detailed proof of Proposition [1] which starts with 
exactly the same idea as [8] . Namely, we will deduce Proposition [1] from the representation 
result for arbitrary weakly exchangeable arrays that are not necessarily positive definite, 
due to D. Aldous (pQ, [2]) and D.N. Hoover ([9], \1Q\), which states that for any weakly 
exchangeable matrix there exist two measurable functions / : [0, l] 4 — > R and g : [0, l] 2 — > R 
such that the distribution of the matrix coincides with the distribution of 

Ri t i = g(u,ui) and Ri )V = f(u,u h ui>,ui } i>) for l^l', (1.3) 

where random variables u, (ui), {uij>) are i.i.d. uniform on [0, 1] and function / is symmetric 
in the middle two coordinates, ui and ny. It is customary to define the diagonal elements 
as a function of three variables R^i = g(u,U[,vi) where (uj) is another i.i.d. sequence with 
uniform distribution on [0, 1]; however, one can always express a pair (ui, v{) as a function of 
one uniform random variable u\ in order to obtain the representation (JT73J. We will consider 
a weakly exchangeable matrix defined by (jl.3p and, under an additional assumption that it 
is positive definite with probability one, we will prove that its distribution is a mixture of 
distributions generated by an i.i.d. sample in the sense of (|1.2j) . First, in Section [2] we will 
consider a uniformly bounded case, |/|, \g\ < 1, and then in Section [3] we will show how the 
unbounded case follows by a truncation argument introduced in [6]. In the general case of 
Section [3] we do not require any integrability conditions on g rather than g < +00. Finally, 
to a reader interested in the proof of (11.31) we recommend a comprehensive recent survey [1] 
of the representation results for exchangeable arrays. 
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2 Bounded case. 



We will start with the case when the matrix elements \Rij>\ < 1 for all 1,1' > 1 with proba- 
bility one, so we can assume that both functions |/|, \g\ < 1. Of course, the representation 
of the law of R as the mixture (11.11) will be simply the disintegration of the law of (jl.3p on 
Ai over the first coordinate u. The main problem is now to show that for a fixed u the (law 
of) matrix R can be represented as (jl.2p . In other words, if we make the dependence of / 
and g on u implicit, then assuming that a weakly exchangeable matrix given by 

Ri,i = g{ui) and R l>v = f(ui,ui>,uiy) for I ^ V (2.1) 

is positive definite with probability one, we need to show that its law can be represented 
as (jl.2p . Our first step is to show that / does not depend on the last coordinate, which is 
exactly the same as Lemma 3 in [8]. 

Lemma 1 If R in A2. 1\) is positive definite with probability one then for 

f{x,y) = / f{x,y,u)du 
Jo 

we have f(ui, u 2 , Ui >2 ) = f(u\,u 2 ) a.s. 

Proof. We will give a sketch of the proof for completeness. Since (Ri t y) is positive definite, 
for any sequence of bounded measurable functions (hi) on [0, 1], 

1 n 

- V E'R^h^hM > (2.2) 
l,i'=i 

almost surely, where E' denotes the expectation in (ui). Let us take n = 4m and given two 
measurable sets A%, A 2 C [0, 1], let hi(x) be equal to 

I(x G Ai) for 1 < I < m, —I(x G Ax) for m + 1 < I < 2m, 

I(x G A 2 ) for 2m + 1 < / < 3m, -I(x G A 2 ) for 3m + 1 < / < 4m. 
With this choice of (hi), the sum over the diagonal terms I = I' in (12.21) is a constant, 




Off-diagonal elements in the sum in (12. 2p will all be of the type 

±IJf {x ,y,u L „)d X dy (2.3) 

AjXA jf 

and for each of the three combination A\ x Ai,A\ x A 2 and A 2 x A 2 the number of i.i.d. 
terms of each type will be of order n 2 , while the difference between the number of terms with 
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opposite signs of each type will be at most n/2. Therefore, by the central limit theorem, the 
distribution of the left hand side of (12. 2p converges weakly to some normal distribution and 
( 12.21) can hold only if the variance of the terms in ( 12. 31) is zero, i.e. these terms are almost 
surely constant. In particular, 

J J f(%, V, ui >2 ) dxdy=JJ f(x, y) dx dy 

A 1 xA 2 AixA 2 

with probability one. The same holds for some countable collection of sets A\ x A 2 that 
generate the product cx-algebra on [0, l] 2 and this proves that for almost all z on [0,1], 
f(x, y, z) = f(x, y) for almost all (x, y) on [0, l] 2 . 

□ 

For simplicity of notations we will keep writing / instead of / so that now 

Ri,i = g(ui) and Rij, = f{u h u v ) for I ^ V (2.4) 
is positive definite with probability one and |/|, \g\ < 1. 



Lemma 2 If R in \2.J$ is positive definite with probability one then there exists a measurable 
map (j) : [0, 1] — > B where B is the unit ball of I 2 such that 

f(x,y) = <f>(x)-<f>{y) (2.5) 

almost surely on [0, l] 2 . 



Remark. It is an important feature of the proof (similar to the argument in [6]) that the 
representation (12.51) of the off-diagonal elements is determined independently of the function 
g that defines the diagonal elements. The diagonal elements play an auxiliary role in the 
proof of (12.51) simply through the fact that for some function g the matrix R in (12. 4p is 
positive definite. Once the representation (12.51) is determined, the representation (II. 2p will 
immediately follow. 

□ 

Proof. Let us begin the proof with a simple observation that the fact that the matrix (12.41) 
is positive definite implies that f(x,y) is a symmetric positive definite kernel on [0, l] 2 , 

JJf{x,y)h(x)h(y)dxdy>0 (2.6) 

for any h G L 2 ([0, 1]). Since (Ri,i>) is positive definite, n~ 2 J2i i><nRi,i'h( u i)h(ui>) > and 
since \Ri,i\ < 1, the diagonal terms n~ 2 ^2 l<n Rijh(ui) 2 — > a.s. as n — > +oo. Therefore, if 
we define 

2 ^ 

S n=— f TT /] f{u U U V )h(ui)h(u V ) 

n(n — 1) 

v 1 \<KV<n 

then lim inf n^+oo S n > a.s. and (12. 6p follows by the law of large numbers for [/-statistics 
(Theorem 4.1.4 in [5]), the proof of which we will recall for completeness. Namely, if we 
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consider cx-algebra T n = ■ ■ ■ , U( n ), (ui)i >n ) where uq), . . . , U( n ) are the order statistics 

of ui,...,u n then (S , n ,^ r n ) is a reversed martingale and flnM-^" ^ s trivial by the Hewitt- 
Savage zero-one law since it is in the tail a-algebra of i.i.d. (ui)i>i. Therefore, a.s. 

0< lim S n = E(S 2 \ Pi Fn) =ES 2 

n>l 

which proves ( 12. 6ft . Since f(x,y) is symmetric and in £ 2 ([0, l] 2 ), there exists an orthonormal 
sequence (c/?j) in £ 2 ([0, 1]) such that (Theorem 4.2 in [T2] ) 

/(^ y) = 5^ A? Vii^Viiv) ( 2 -7) 

where the series converges in L 2 ([0,1] 2 ). By (12.61) . all A/ > and it is clear that now we 
would like to define <fi in (I2.5P by 

0(x) = (V^w(*)),> 1 :[O.l]- > ^- (2.8) 

However, we still need to prove that the series in (12. 7p converges a.s. on [0, l] 2 and that 
^2i>i <Pi{ x Y — 1 a - s - 011 [0; 1]> i- e - the map is indeed into the unit ball of £ 2 . For m > 1, 
let "D m be the cx-algebra on [0, 1] generated by the dyadic intervals [i2~ m , {i + l)2~ m ) so that 
V m C T> m+ i and cr(|J m>1 P m ) is the Borel a-algebra on [0, 1]. Let V m = V m ® V m be the 
product cx-algebra on [0, l] 2 and let 

fn(x,y) = ^ifi( x )fi(y)- 

l<l<n 

Since the series in (12.71) converges in L 2 ([0, l] 2 ), we can choose a subsequence (rij) such that 
the L 2 -norm \\f n . - f\\ 2 < j~ 2 . Therefore, ||E(/ nj |£> m ) - E(f\V m )\\ 2 < t 2 for m > 1 and, 
by the Borel- Cantelli lemma, 

lim E(f n] \V m )=E(f\V m ) &.s. (2.9) 
Since V m is the product cr-algebra, E(f nj \V m ) is equal to 

XiE(ifi l \V m )(x)E(ifi l \V m )(y) 

l<l<n,j 

and for x G [i2~ m , (i + l)2 _m ) we can define 

Mi+l)2~ m 

E( !f i\V m )(x) = 2 m w(s)ds. 

Ji2- m 

Therefore, for x and y in the same dyadic interval [i2~ m , (i + l)2~ m ), 

E(f n .\V m )(x,y) = k{nVi\V m )?{x) 

l<l<nj 
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and since \E(f\V m )\ < 1, implies that 



2 'x) < 1 a.s. 



lim V A,(E(^|X> m )) 

1— > + oo ' 

l<Z<rij 

The fact that all A/ > implies that for any n > 1 

^ Ai(E(^|^m)) 2 (^) < 1 a.s. 

l<Z<n 

By the martingale convergence theorem, E(^|P m ) — > a.s. as m — > +oo and, therefore, 
12i<n^iff( x ) — 1 a,s - Letting n — ► +oo implies 

= ^A^(x) < 1 a.s. (2.10) 

i>i 

so the map (f> in (I2.8p . indeed, maps [0, 1] into the unit ball of i 2 . Let us now show that (12.51) 
holds, i.e. the series in (\2.7\i converges a.s. Given n > 1, let us take rij > n and write 

|/-/n| < \f~fn s \ + \fnj-fn\- 

The first term goes to zero a.s. by the Borel-Cantelli lemma since ||/ n . — /||2 < j~ 2 and the 
second term can be bounded by 

\fn s (x,y) - f n {x,y) | = \ ^ \iipi(x)ipi{y) 

n<l<rij 

l>n l>n 

and, by (12.101) . also goes to zero a.s. as n — > +oo. Finally, the map is measurable since for 
any open ball B e (h) in £ 2 of radius £ centered at h, -1 (.B e (/i)) can be written as 



^2 h <Pi( x ) ~ 2 $Z h i^iVi(. x ) + h-h<e 



i>i i>i 

and the left hand side is obviously a measurable function. This finishes the proof. 

□ 

Lemma [2] proves that if (hi,t{) is an i.i.d. sequence from distribution rj = A o [tp^g)^ 1 on 
I 2 x M + then the law of R in (12. 4p coincides with the the law of 

(VM 1 - V)+*jMj,j'>i- (2 - n) 

To prove fll.2jl it remains to show that \\hi\\ 2 < ti a.s. and define a\ = t\ — ||/^|| 2 . 
Lemma 3 The measure rj is concentrated on the set {(h,t) : \\h\\ 2 < t}. 
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Proof. Suppose not. Then there exists (ho, to) such that \\h \\ 2 > t Q and such that the set 

A £ = B £ (h ) x (t - e, to + e) 

has positive measure r/(A e ) > for all e > 0. Let us take e > small enough such that for 
any (hi,ti), (h 2 ,t 2 ) G A E we have 

(hi ■ h 2 ) 2 > t 1 t 2 + e. 

Since r](A £ ) > 0, this contradicts the fact that for two independent copies (hi, ti), (h 2 , t 2 ) 
from distribution rj the matrix 

( ti hi-h 2 \ 
\ h- h 2 t 2 J 

is positive definite with probability one. 

□ 

3 Unbounded case. 

The idea of reducing the unbounded case to bounded one is briefly explained at the very 
end of the proof in [B] and here we will fill in the details. Let us define a map $jy : M — > M 
such that for T e M, 

// i\T \ 1/2 \ // N \V2 \ 
(Mr)V = r M ,min((— ) ,l)min(( f -) , 1 J (3.1) 

and, in particular, (^/^(r));^ = min(A^, T^i). Define a map ipN '■ ^ x K + — > £ 2 x M + by 

Vjv(/i,*) = ^min(^) 1/2 ,l),min(iV,t)). (3.2) 

Let us make two simple observations that follow from the definitions ( 13.11) and ( 13.21 ) : 

(a) if T is a Gram-de Finetti matrix then ^/^(r) is also a Gram-de Finetti matrix uniformly 
bounded by iV with probability one; 

(b) if T is a Gram-de Finetti matrix generated as in (12. lip by an i.i.d. sample (hi, U) from 
distribution v on £ 2 x M + then ^^(r) is generated by an i.i.d. sample (ipN(hi,ti)) from 
distribution v o ijj^f. 

Consider a Gram-de Finetti matrix R. Since ^n(R) is uniformly bounded, the results of 
Section [2] imply that it can be generated as in (12.111) by an i.i.d. sample from some measure 
i] N on £ 2 x M + . Since \im N ^ +OQ ^ N (R) = R a.s., this indicates that R should be generated 
by an i.i.d. sample from distribution rj defined as a limit of rj^. However, to ensure that 
this limit exists we first need to redefine the sequence (i]n) in a consistent way. For this, we 
will need to use the fact that a measure rj in the representation 02. Ill) is unique up to an 
orthogonal transformation of its marginal on £ 2 . 
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Lemma 4 // (hi,ti) and (h'^t'f) are i.i.d. samples from distributions r\ and rf on £ 2 x M + 
correspondingly and 

(/irMi-M+**M = ( 3 - 3 ) 

i/ien there exists a unitary operator q on £ 2 such that r\ — rj' o (g, id) -1 . 

Proof. Let us begin by showing that the values hi ■ hi and h\ ■ h\ can be reconstructed 
almost surely from the matrices (13.31) . Consider a sequence (gi) on £ 2 such that \\gi\\ 2 = U 
and gi ■ gy = hi ■ hy for all / < Without loss of generality, let us assume that 

9i = h + \/ti - \\hi\\ 2 ei 

where (e/) is an orthonormal sequence orthogonal to the closed span of (hi) (if necessary, 
we identify £ 2 with £ 2 © £ 2 to choose the sequence (e/)). Since (hi) is an i.i.d. sequence from 
the marginal fi of measure i] on £ 2 , with probability one there are elements in the sequence 
(hi)i>2 arbitrarily close to hi and, therefore, the length of the orthogonal projection of hi 
onto the closed span of (hi)i> 2 is equal to \\hi\\. As a result, the length of the orthogonal 
projection of gi onto the closed span of (gi)i>2 is also equal to \\hi\\ which means that we 
reconstructed \\hi\\ from the first matrix in (13.31) . Therefore, (I3.3P implies that 

((/i, ■/*)>(*!)) = ({K-h' v )M))- (3-4) 

Given (hi ■ hy) and (h[ ■ h[,), we can now construct sequences (xi) and (x' t ) isometric to (hi) 
and (h[) in some pre-determined way, for example, by choosing xi and x\ to be in the span 
of the first I elements of some fixed orthonormal basis. Then there exist (random) unitary 
operators U = U((hi)i>i) and U' = U'((hi)i>i) on £ 2 such that 

xi = Uhi and x\ = U%. (3.5) 

By the strong law of large number for empirical measures (Theorem 11.4.1 in [7]) 

- 6 ^t:) -> V and - 6 W -> rf 

l<l<n l<l<n 

weakly almost surely, and therefore, (13.51) implies that 

- Yl 5 (*uti) ^ V ° (U, id)- 1 and ^ ^ 5^ -> rj' o (U' , id)" 1 

l<l<n l<l<n 

weakly almost surely. Therefore, since (xi,ti) and (x'^t^) have the same distribution by (j3.4j) . 
7] o ([/, id) -1 and rf o (f/',id) _1 have the same distribution on the space of all probability 
distributions on I 2 x M. + with the topology of weak convergence. This implies that there 
exist non-random unitary operators U and U' such that rj o (U, id) -1 = rf o (If, id)" 1 and 
taking q = U^U 1 finishes the proof. 

□ 
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Using Lemma HJ we will now construct a "consistent" sequence of laws (tjn) recursively 
as follows. Suppose that the measure r] N that generates ^n(R) as in (12. lip has already 
been defined. Suppose now that ^n+i(R) is generated by an i.i.d. sample (hi,ti) from some 
measure t)n+i. Since 

* N (* N+1 (R)) = * N (R), (3.6) 

observation (b) above implies that ^n(R) can also be generated by ipN(hi,ti) from measure 
?7jv+i ° V'jv 1 - Lemma H] then implies that there exists a unitary operator q on I 2 such that 



Vn 



(r] N+1 o ip^) o (q, id) 1 = (r] N+1 o (q, id) o ^ 



since ip^ and (q, id) obviously commute. We now redefine tjn+i to be equal to r/ N+ i o (g, id) -1 . 
Clearly, ty N+ i(R) is still generated by an i.i.d. sequence from this new measure r/ N+ i and in 
addition we have 

r} N = r] N+1 oip- 1 . (3.7) 

Let A N := £ 2 x [0,N). Since ip N (h,t) G if and only if (h,t) e A N and ip N (h,t) = (h,t) 
on the consistency condition (13.71) implies that the restrictions of measures tjn and 
to An are equal. Therefore, t]n converges in total variation to rj = J^Vn UnAAiv-i- Since 
V^at = ipN ° ^iV' for iV < N', (13 ,7p implies that r/Ar = 77^' ip^ and since ^jy is continuous, 
letting N' — ► +00 gives t/n = i] o ip^ 1 . Finally, letting A/" — > +00 proves that is generated 
by an i.i.d. sample from 77 which proves representation (12. lip in the unbounded case, and 
Lemma [3] again implies ( 11. 2ft . 
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